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Abstract 

Chromosome evolution has been demonstrated to have profound effects on diversification rates and speciation in 
angiosperms. While polyploidy has predated some major radiations in plants, it has also been related to decreased 
diversification rates. There has been comparatively little attention to the evolutionary role of gains and losses of single 
chromosomes, which may or not entail changes in the DNA content (then called aneuploidy or dysploidy, respectively). In 
this study we investigate the role of chromosome number transitions and of possible associated genome size changes in 
angiosperm evolution. We model the tempo and mode of chromosome number evolution and its possible correlation with 
patterns of cladogenesis in 15 angiosperm clades. Inferred polyploid transitions are distributed more frequently towards 
recent times than single chromosome gains and losses. This is likely because the latter events do not entail changes in DNA 
content and are probably due to fission or fusion events (dysploidy), as revealed by an analysis of the relationship between 
genome size and chromosome number. Our results support the general pattern that recently originated polyploids fail to 
persist, and suggest that dysploidy may have comparatively longer-term persistence than polyploidy. Changes in 
chromosome number associated with dysploidy were typically observed across the phylogenies based on a chi-square 
analysis, consistent with these changes being neutral with respect to diversification. 
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Introduction 

While variation in chromosome number is widespread among 
plants, its role in species diversification has long been debated [1- 
3]. Transitions in chromosome number comprise the multiplica- 
tion of a whole chromosome set (which may entail a whole genome 
duplication, WGD, or an increase by half of the genome, 
demipolyploidy, which can occur when one homeologous set of 
chromosomes is duplicated in an already existing polyploid [4]), 
and changes in single chromosome number (resulting from fission, 
fusion, duplication or deletion of single or few chromosomes; [5]). 
Demipolyploidy is thought to occur through the crossing of 
gametes of different ploidy levels (e.g., a tetraploid crossed with a 
diploid, followed or preceded by a doubling of the genome, yields 
a hexaploid, which is inferred as a 4 x to 6 x demiploid transition). 
Researchers have long argued about the prevalence of polyploidy 
in flowering plants as well as its evolutionary and genomic 
consequences [3] . Recent works have demonstrated that polyploi- 
dy is ubiquitous in angiosperms and has played an important role 
in many lineages, with evidence of several rounds of both ancient 



and recent polyploidizations [6—11]. The terms paleo- and 
neopolyploid have been loosely used to describe the relative time 
of the polyploidy event - either as ancient (paleo) or recent (neo). 
However, what is ancient and recent varies among authors (e.g., 
neopolyploidy may be defined as a polyploidization event within a 
genus or within a species; [12]). In addition, there are still 
conflicting opinions on whether there is a positive or negative 
relationship between ancient polyploid events and diversification 
rates. Whereas polyploids have traditionally been regarded as 
evolutionary dead ends [1,13-14], recent studies have suggested 
correspondence between polyploid events and diversification of 
some of the most species-rich angiosperm lineages [3,15], 
Polyploidizations are rather frequent, but newly formed polyploid 
lineages generally fail to persist, which would explain their low 
diversification rates and the biased distribution of polyploidy 
towards terminal branches of the plant tree of life [10,17]. 
Whereas neopolyploids are widespread among angiosperms (an 
estimated ca. 30% of extant species have polyploidized since their 
genus arose [8,10]), paleopolyploidization events are compara- 
tively very rare [9,11,17]. Thus, although polyploidy may initiate 
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rapid diversification within distinct evolutionary lineages (i.e. seed 
plants and angiosperms [9]), it is generally associated with 
decreased diversification rates and higher extinction risk [8,10]. 
Polyploid lineages may have succeeded in becoming established 
during past periods of great environmental upheaval and mass 
extinction events, which created new ecological niches and 
disturbed habitats [17-19]. The surviving polyploid lineages could 
have an enhanced potential for diversification due to their genetic, 
genomic and epigenetic features [2,11,16,20-24], thus attaining 
long-term evolutionary success [10—11,17,19]. A parallel situation 
may currendy be depicted by the high proportion of polyploid 
species in harsh environments like those in high altitudes or 
latitudes [25-26]. 

In this study, we consider gains and losses of single chromo- 
somes as processes that entail (i) change in DNA content 
(aneuploidy: duplication or losses of chromosomes), or (ii) little 
or negligible change in DNA content (dysploidy: fission and/or 
fusion; Supporting Information SI). Despite the prevalence of 
gains and losses of single chromosomes among angiosperms [27], 
the evolutionary role of these chromosome number transitions has 
received far less attention than that of polyploidy. Some studies 
have suggested an important role of gains and losses of single 
chromosomes in species diversification [28-33], but there has been 
no formal test of this role. 

A central issue in the study of chromosome number evolution 
concerns the relative importance of polyploid transitions versus 
gains and losses of single chromosomes [24]. While polyploidy 
takes shape as a more dramatic transition — one that involves the 
whole genome - in practice, the persistence of polyploid lineages is 
widely believed to be higher compared to aneuploid ones [24] . In 
contrast, to our knowledge, the evolutionary persistence of lineages 
affected by dysploidy has not been studied. 

Here, we perform a phylogenetic comparative analysis of 
chromosome evolution and lineage diversification in 15 flowering 
plant clades to estimate the relative importance of polyploidy and 
gains and losses of single chromosomes in the evolution of 
angiosperms. The particular aims of this study are to evaluate (i) 
previous hypotheses about the role of polyploidy in angiosperm 
diversification; (ii) the persistence of gains and losses of single 
chromosomes (including a priori aneuploidy and dysploidy) along 
angiosperm evolution; and (iii) the relative distribution and timing 
of polyploidy and gains and losses of single chromosomes across 
angiosperm phylogenies. 

Materials and Methods 

Phylogenetic sampling 

We used molecular phylogenies from 15 angiosperm groups 
(Supporting Information SI; Material SI) belonging to both dicots 
(10) and monocots (5), and displaying both monocentric and 
holocentric chromosomes (Table 1; Supporting Information SI). 
The four Cyperaceae datasets included in this study were treated 
independendy: the Cariceae dataset represents the main lineages 
of the tribe and comprised 1-2 species per each Carex section, 
whereas the three Carex sections (Ovales; Phacocystis, and Spiros- 
tachyae) included a high percentage of the extant species of each of 
those sections. Each co-author provided 1-2 datasets for which 
they were experts, and which had a reliable published phylogeny, 
information to calculate absolute times of divergence and 
chromosome number variation (Supporting Information SI; 
Material SI). Taxonomic level of the phylogenies was heteroge- 
neous, comprising families (two), tribes (three), genera (five) and 
sections (five). Molecular phylogenies with branch lengths scaled to 
the number of nucleotide substitutions were obtained from the 



authors of the original published studies (Supporting Information 
SI; Material SI). The trees were rooted using outgroup species or 
clades as specified in the original studies and dated using the 
penalized likelihood method [34] as implemented in the APE R 
package [35-36] . The most appropriate smoothing parameter was 
chosen based on an initial cross-validation run and calibration 
points were included to transform trees from relative to absolute 
times. For the Orchidinae dataset, an ultrametric tree previously 
generated using BEAST [37] was directiy obtained from the study 
author (Supporting Information SI; Material SI). Chromosome 
counts (used for the reconstruction of chromosome number 
transitions) were obtained for each group from different sources 
(Supporting Information SI; Material SI). Tips of the phylogeny 
without available chromosome counts were pruned. All our 
datasets had: (i) sequence data for at least 12 species (maximum 
100 species); (ii) chromosome counts available for at least 75% of 
the species sampled in the phylogeny, and (iii) variation in 
chromosome numbers among the species remaining in the 
phylogeny (Table 1). 

Tempo and mode of chromosome evolution 

Given the dated molecular phylogenies and the assignments of 
chromosome numbers to the tips, we aimed to infer the location of 
chromosome number transitions using the ChromEvol methodol- 
ogy [4] . This likelihood-based method assesses the fit of several 
models that allow for various types of chromosome number 
change along the phylogenies and infers the type of each transition 
in chromosome number (WGD, demipolyploidy, increase and 
decrease in a single chromosome; Table 2) along the branches of 
the tree. We ran all eight available models and used the Akaike 
information criterion (AIC [38]) to select the best model for each 
dataset. Models of chromosome evolution in ChromEvol are based 
on two (gain and loss of a single chromosome) to six parameters 
(polyploidy, demipolyploidy, gain and loss of a single chromosome 
and gain and loss of a single chromosome proportional to the 
current chromosome number) representing chromosome transi- 
tions (Table 2). The expected numbers of polyploidy events and 
gains and losses of single chromosomes along each branch of the 
phylogeny were recorded based on the best-fitting model. 

Using the ChromEvol software (evaluatePPDist option), we 
calculated the observed chromosome number transitions per unit 
of time that occurred relatively recently (four temporal strategies: 
tip branches, from present to 10% of total time, from present to 
25% of total time and from present to 50% of total time) and those 
that occurred deeper in time (the rest of the tree). We calculated 
the expected number of each type of chromosome number 
transition (polyploidizations plus demipolyploidizations and gains 
plus losses of single chromosomes) along the tree assuming that 
they occur homogenously over time as the null hypothesis (total 
number of events inferred along the tree divided by the total time). 
Then chi-square was used to test whether the number of observed 
transitions along external and internal branches for each temporal 
level is significandy different than the number of transitions under 
the expectation of constant transition rate along the tree (null 
hypothesis). P values smaller than 0.002 were considered 
significant to reject the null hypothesis. We selected this 
conservative P-value because our tests are two-tailed and because 
we did multiple tests and therefore Bonferroni correction is 
required. Nevertheless, P-values smaller than 0.025 were consid- 
ered as marginal support to reject the null hypothesis. 

Study of gains and losses of single chromosomes 

Gains and losses of single chromosomes encompass different 
phenomena with various expected outcomes: 1) Aneuploidy: 
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Table 1. Main features of the datasets analysed in this study. 
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*S phy : number of species sampled in the phylogeny; S counts : number of species sampled in the phylogeny with known chromosome counts; C-range: observed range of 

chromosome numbers. 
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duplication or loss of a chromosome including its DNA content, 
and 2) Dysploidy: chromosome fusion/fission that do not result in 
changes in DNA content. In addition, losing chromosomes after 
polyploidization has a different outcome than when there is not 
previous polyploidization (in the first case, although genes may be 
lost, there are extra copies of the genes). In order to differentiate 
between different patterns of gains and losses of single chromo- 
somes for lineages for which these transitions were inferred, we 
gathered genome size values from the Plant DNA C-values 
database (data.kew.org/cvalues/) and references therein. We then 
analyzed the possible correlation between genome size and 
chromosome numbers (excluding species affected by polyploidy) 
both visually and using linear models as implemented in the 
function Im of the software R [36]. These analyses were not 
performed for lineages with holocentric chromosomes (Cariceae 
and other Carex datasets) since gains and losses of single 
chromosomes in these groups is already known to be from fissions 
and fusions (dysploidy; see Supporting Information SI [39-41]). 
We also analyzed the chromosome number reconstructions 
obtained from ChromEvol to classify losses of single chromosomes 
as occurring following a polyploidization event or not. 

Results 

The studied groups comprise a high cytogenetic variability, with 
chromosome numbers ranging from n = 5 to n— 110 (Table 1). 
ChromEvol analyses (Table 2, Table SI in Supporting Informa- 
tion S2) reveal that models of chromosome evolution including 
polyploidy (12 groups) are more frequent than those which do not 
(three groups). Interestingly, demipolyploidization, associated with 
hybridization between different ploidy levels and allopolyploidiza- 
tion processes [4] , is inferred in seven of them. Gains and losses of 
single chromosomes are inferred in 12 groups: 10 of them are 
affected by single chromosome gains, nine by losses and seven by 
both events. Rates of gains and losses of single chromosomes are 
independent of the current chromosome number except for four 
groups (see inferred models in Table 2). We have found a high 



heterogeneity in chromosome transition rates both for gains and 
losses of single chromosomes (0.0013-80.9440 ^transitions per 
million year (my )) and polyploid transitions (0.0013-0.7888 
^transitions my ) (Supporting Information SI). The absolute 
number of chromosome gain and loss events inferred by our 
analyses is much higher than that of polyploidy for all groups with 
both types of transitions, with the exception of Passiflora, Erodium 
and Cistaceae datasets (Table 2). 

Our analyses (at four different temporal levels) comparing the 
expected and the observed number of transitions per time reveal 
similar results: First, the number of polyploid transitions towards 
the tips of the trees is significantly higher than expected under the 
null hypothesis (constant polyploidization rate through time) in 
four of the 12 datasets with polyploidy, in two data sets there is 
marginal support to reject the null hypothesis whereas for the 
remaining six data sets the results were congruent with it (Table 2). 
Second, gains and losses of single chromosomes were in general 
distributed much more evenly across the trees, with ten of the 12 
datasets exhibiting a distribution of gains and losses of single 
chromosomes statistically indistinguishable from expectation 
under the null hypothesis (constant chromosome gains and losses 
transition rate), whereas one exhibited a significandy higher 
number of these events towards the tips (Erodium, Table 2) and one 
(Cariceae) displayed marginal support for the latter pattern (but 
with opposite results when comparing separately gains and losses; 
results not shown). Correlation analyses between genome size and 
chromosome numbers (6 of 12 datasets; Tables S2-S7 and Figures 
S1-S6 in Supporting Information S2) and previous studies for the 
Cyperaceae revealed that gains and losses of single chromosomes 
are probably due to dysploidy for at least 10 of the 12 datasets for 
which this kind of transition was inferred (Table 2, Tables S2-S7 
and Figures S1-S6 in Supporting Information S2). Therefore, 
dysploid transitions occurring in these groups would mosdy 
correspond to fissions, fusions and genome rearrangements 
without changes in DNA content (Table 2; Supporting Informa- 
tion SI). For the remaining two datasets (Armaria and Erodium) we 
cannot differentiate between dysploidy and aneuploidy. 
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Discussion 

In this study, we have compared the expected vs. observed 
number of chromosome transitions in a temporal context following 
several alternative strategies (tips vs. rest of the chronogram tree, 
present to 10% of total time vs. rest of the chronogram tree, 
present to 25% of total time vs. rest of the chronogram tree, and 
present to 50% of total time vs. rest of the chronogram tree) 
obtaining identical results. Inferred polyploidizations were distrib- 
uted closer to the tips of the trees than expected under constant 
chromosome transition rate (null hypothesis) for half of datasets 
under all tests (Table 2). The cited pattern was significant for four 
of 1 2 datasets, and for two additional datasets the analyses suggest 
marginal support against null hypothesis. In other words, ancient 
polyploidy events are underrepresented within the half angiosperm 
clades studied here. Presuming that the rate of polyploidy is 
constant within lineages, this result suggests that polyploidy rarely 
has long-term evolutionary success [9—11,17]. Alternatively, the 
methodology implemented in ChromEvol could have lower power 
to detect ancient polyploidy events than recent ones. However, 
studies based on other approaches [9,11,17] have reached similar 
conclusions. Our results support the general pattern that recently 
originated polyploids fail to persist and diversify at lower rates 
[10]. Our data provide no obvious insight into the fact that some 
of the most extensive plant radiations have been predated by a 
polyploidization [3,9,17]. 

Gains and losses of single chromosomes may originate via many 
disparate mechanisms, which entail small changes in the 
chromosome number and changes (aneuploidy) or not (dysploidy) 
in DNA content (Supporting Information SI). Gains and losses of 
single chromosomes are widespread among angiosperms [26], and 
imply far less genomic disruption than polyploidy (it only affects 
one or a few chromosomes rather than a complete chromosome 
set and it does not necessarily entail changes in DNA content; 
Supporting information SI). However, polyploidy is much more 
studied and often viewed as the most common type of 
chromosome transition in plants and the main chromosomal 
driver of plant diversification [12]. Our results indicate that gains 
and losses of single chromosomes are also very common across 
angiosperms (12 of 15 groups) and that they may frequently co- 
occur with polyploidy within lineages (both kind of transitions 
inferred in nine of 15; Table 2). 

Our results suggest that most of the inferred gains and losses of 
single chromosomes are dysploid events, and therefore these 
transitions mostly correspond to changes in chromosome number 
without changes in DNA content. Gene balance theory predicts 
that loss or duplication of a subset of chromosomes (aneuploidy) 
should be more strongly selected against than whole-genome 
duplication [24]. In congruence, as far as we are concerned, in 
contrast to polyploidy, ancestral aneuploidy has not been inferred 
in angiosperms. Nonetheless, our results demonstrate that 
dysploidy is very frequent in angiosperms, and its effects on 
lineage diversification deserve further study. Observed dysploid 
events were not distributed closer to the tips of the phylogeny than 
expected for most datasets (Table 2). Therefore, contrary to 
polyploidy, dysploidy appears to be equally distributed early and 
late across the phylogenies examined. This is consistent with the 
hypothesis that fusion and fission events are neutral with respect to 
long-term diversification processes, neither increasing nor decreas- 
ing speciation and extinction processes substantially. 

Interestingly, holocentric chromosomes (without localized cen- 
tromere) evolve almost strictly by fission and fusion which do not 
convey changes in DNA content and those rearrangements are 
generally neutral or nearly so because of the effects of the diffuse 
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centromere [41]. Moreover, structural changes from fission and 
fusion are expected not to be underdominant [40] . Our results are 
in line with this hypothesis for the four datasets with holocentric 
chromosomes as we cannot reject the null hypothesis for any of 
them (Table 2). Additional studies are required to compare 
diversification patterns of organisms having different kinds of 
chromosomes (holocentric vs. monocentric chromosomes; Sup- 
porting Information SI). 

To sum up, our results support the hypothesis that dysploidy is 
less disadvantageous than polyploidy in terms of generating long 
term persisting lineages [17]. This is most likely because the 
inferred dysploid transitions typically do not necessarily entail 
changes in DNA content but only genome structural rearrange- 
ments (Supporting Information SI). 
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